Towards Deep Modeling of Music Semantics using EEG Regularizers
نویسندگان
چکیده
Modeling of music audio semantics has been previously tackled through learning of mappings from audio data to high-level tags or latent unsupervised spaces. The resulting semantic spaces are theoretically limited, either because the chosen high-level tags do not cover all of music semantics or because audio data itself is not enough to determine music semantics. In this paper, we propose a generic framework for semantics modeling that focuses on the perception of the listener, through EEG data, in addition to audio data. We implement this framework using a novel end-to-end 2-view Neural Network (NN) architecture and a Deep Canonical Correlation Analysis (DCCA) loss function that forces the semantic embedding spaces of both views to be maximally correlated. We also detail how the EEG dataset was collected and use it to train our proposed model. We evaluate the learned semantic space in a transfer learning context, by using it as an audio feature extractor in an independent dataset and proxy task: music audio-lyrics crossmodal retrieval. We show that our embedding model outperforms Spotify features and performs comparably to a state-of-the-art embedding model that was trained on 700 times more data. We further discuss improvements to the model that are likely to improve its performance.
منابع مشابه
Neural Correlates of Boredom in Music Perception
Introduction: Music can elicit powerful emotional responses, the neural correlates of which have not been properly understood. An important aspect about the quality of any musical piece is its ability to elicit a sense of excitement in the listeners. In this study, we investigated the neural correlates of boredom evoked by music in human subjects. Methods: We used EEG recording in nine sub...
متن کاملFusion of electroencephalographic dynamics and musical contents for estimating emotional responses in music listening
Electroencephalography (EEG)-based emotion classification during music listening has gained increasing attention nowadays due to its promise of potential applications such as musical affective brain-computer interface (ABCI), neuromarketing, music therapy, and implicit multimedia tagging and triggering. However, music is an ecologically valid and complex stimulus that conveys certain emotions t...
متن کاملClassifying music perception and imagination using EEG
This study explored whether we could accurately classify perceived and imagined musical stimuli from EEG data. Successful EEG-based classification of what an individual is imagining could pave the way for novel communication techniques, such as brain-computer interfaces. We recorded EEG with a 64-channel BioSemi system while participants heard or imagined different musical stimuli. Using princi...
متن کاملCombination of Beamforming and Synchronization Methods for Epileptic Source Localization, using Simulated EEG Signals
Localization of sources in patients with focal seizure has recently attracted many attentions. In the severe cases of focal seizure, there is a possibility of doing neurosurgery operation to remove the defected tissue. The prosperity of this heavy operation completely depends on the accuracy of source localization. To increase this accuracy, this paper presents a new weighted beamforming method...
متن کاملA hybrid EEG-based emotion recognition approach using Wavelet Convolutional Neural Networks (WCNN) and support vector machine
Nowadays, deep learning and convolutional neural networks (CNNs) have become widespread tools in many biomedical engineering studies. CNN is an end-to-end tool which makes processing procedure integrated, but in some situations, this processing tool requires to be fused with machine learning methods to be more accurate. In this paper, a hybrid approach based on deep features extracted from Wave...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1712.05197 شماره
صفحات -
تاریخ انتشار 2017